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1. Introduction 


In the present note some aspects of the statistical description of macroscopic 
systems are discussed. What we are aiming at is a rational foundation, on a 
microscopic basis, of the theory of non equilibrium phenomena. However, this 
problem is only touched upon in the present work. For the characterization of 
macroscopic states non equilibrium thermodynamics uses observables which are 
closely related to the observables of equilibrium thermodynamics. For the com- 
plete characterization of macroscopic states in the sense of Ehrenfest’s postulate 
of determination [Ehrenfest, 1911], that is the postulate that the macroscopic 
state of the system at a given time determines the macroscopic state at every 
later time, it is in general not sufficient to use observables of the thermo- 
dynamic type. It is assumed that it is possible to obtain a complete set of 
observables by adding to the observables of the thermodynamic type observables 
of other types. Ehrenfest’s postulate refers to closed systems and for open systems 
it is obviously necessary to prescribe boundary conditions in order to make the 
characterization of the state complete. On a microscopic level the observables 
of the phenomenological theory are assumed to correspond to phase functions 
or to quantum mechanical operators. Obviously these functions and these oper- 
ators must be of a very peculiar nature as they are responsible for the macro- 
scopic properties of the system, in particular for its possible irreversible ap- 
pearance. 

The point of view taken in this note is that at least certain classes of mac- 
roscopic observables can be regarded as generators of generalized canonical dis- 
tributions—the concept canonical distribution or ensemble being used in the 
sense of [Gibbs, 1902]—in a phase space, which may be of the classical type 
or of a more general nature. For this purpose it seemed desirable to investigate 
the physical and mathematical properties of such generalized canonical distribu- 
tions in some detail. The content of this note is hence related both to the work 
of Grad on the statistical mechanics of systems with an arbitrary number of 
integrals of motion [Grad, 1952], and to the work of Minster on the trans- 
formation theory for the distributions of statistical mechanics [Miinster, 1953]. 
It should be noticed, however, that the phase functions which we consider are 
not assumed to be even approximate integrals of motion. In fact the develop- 
ments in this note are independent of dynamical considerations. The character 
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of the macroscopic observables as functions of the time will be discussed in a 
forthcoming paper. 

Asymptotic expansions of the probability distributions of macroscopic observ- 
ables in terms of a parameter characterizing in some way the size of the system 
are very closely related to the classical limit theorems for sums of independent 
random variables. In particular an expansion given by Cramér [Cramér, 1928] 
appears to be useful for the statistical mechanics of macroscopic systems. 

In the second paragraph of the note some basic phenomenological concepts are 
introduced. The general framework for the statistical description is presented in 
paragraph three. In paragraph four the macroscopic and microscopic descriptions 
are connected by the introduction of the generalized canonical distribution. The 
induced probability distributions for the macroscopic observables are discussed 
in paragraphs five, six and ten. Generalized thermodynamic relations are 
derived in paragraphs seven and eight. In paragraph nine stability conditions are 
derived from the generalized canonical distribution. Asymptotic expansions for 
the so called structure function V(-) are discussed in paragraph eleven. The struc- 
ture function corresponds in the classical case to the phase volume between 
two energy surfaces. In an appendix the mathematical structure of the generalized 
canonical distribution is investigated. 


2. Thermodynamic states 


Let the thermodynamic state of a given system be characterized by the values 
of a finite number of extensive variables (A!, ..., A")=A: 


Ae (ee Oa (1) 


the thermodynamic coordinates of the system. It is assumed that every thermo- 
dynamic state (1) is a possible equilibrium state if the system is subjected to 
appropriate constraints. The characterization of thermodynamic states by the 
variables A thus corresponds to some prescribed class of constraints. If the 
constraints are changed within this class there will in general occur a change 
of the systems state. However, it is not assumed in the definition of thermo- 
dynamic equilibrium that for fixed constraints the system will ultimately reach 
an equilibrium state. 

Illustrative examples of the extensive A-variables are energy, mass of sub- 
stance, electric charge, of the total system and of subsystems. 

The operational characterization of thermodynamic states by means of a pre- 
seribed class of constraints can be given a more precise meaning in the frame- 
work of statistical thermodynamics. From now on we shall consider the thermo- 
dynamics of the system as derived from a statistical model which will be de- 
scribed in the next paragraph. 


3. Statistical thermodynamics 


Let o denote the phase of the system which is macroscopically described by 
the variables A. We shall take the concept phase in a very wide sense. One 
may think of ¢ as representing e.g. the number of particles of different species, 
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their coordinates and conjugated momenta. One may also think of a quantum 
mechanical description of the system. In this case ¢ will correspond to some 
representation of the linear operators in the state vector space which correspond 
to observables of the system. The detailed structure of the phase is, however, 
of minor importance for the following development. The essential property of 


the phase space Z={f}, considered as an abstract space, is that it has a meas- 


ure w(d¢) defined on it, corresponding to an integral 


J x(0) (dd), (2) 


Z 


where «(¢) denotes an arbitrary function of the phase.! In general we have 
a (Z) = sin 22), 
Any w-continuous probability measure in Z has the form 


1 
exp {=o 0e)fe (dé), (3) 


where Boltzmann’s constant x has been introduced for convenience. The function 
o(¢), which is admitted to take the value + 0°, is subject only to the normal- 
ization condition 


| exp {—7 0 )}(@)=1 (4) 


4. Correspondence postulate 


The connection between the thermodynamic description and the phase space 
description is established by the following postulate. 

Every macroscopic constraint of the specified class (paragragh 2) is characterized 
by the values of n real variables (B,, ..., B,)=B: 


B=B=(f,, ---1Bn)- (5) 


1 The precise formulation of these statements is: The phase space is a measure space 
{Z, F, u(-)}. 


Here JF is a o-field of measurable subsets of Z, containing Z and the empty set ¢. The 
measure j/(-) is a G-additive, non negative set-function on F. Tn general (Z) = +co, In this 
case (+) is assumed to be o-finite. In the future we shall avoid these technical terms which 
are explained in every text on measure theory, see e.g. [Halmos, 1950]. In particular when 
speaking about arbitrary functions of the phase we shall always mean real functions meas- 
urable with respect to JF. It should be recognized that integration with respect to the measure 
u(-) will in general include ordinary integration with respect to the Liouville measure 

const. dpdgq and summations. In particular when a quantum mechanical description is used. 
it is natural to consider the integral (2) as primarily defined as a linear functional on the 
space of functions of the phase and the measure fu(-) as a derived quantity. 
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The domain of the B-variables, Dj; is a convex subset of the n-dimensional real 
space R".1 Every set of values of the B-variables determines a probability 
distribution P;(d¢) in Z. 

To every extensive variable A“ (paragraph 2) there corresponds a function of 
the phase a”(f). Subject to the probability distribution P3(d¢), a“ (¢) is a random 
variable. The values of this random variable are identified with the observed 
values of A”, given the constraint B=f. The thermodynamic state (paragraph 2) 
corresponding to the constraint B=, is characterized by the expected values 


of the random variables a” (C) 


a= Ena" = fa*(c) P,(d¢), (gaily ees n). (6) 
Z 


The probability measure P3(d¢) is of the form (3) with 
Gg((\== 8 .0h(C)— (Bye (7) 


that is P;(d¢) is a generalized canonical distribution. 

The following conditions a., b., c. must be satisfied for every permissible /, that 
ig tor every pe D,:* 

a. The function o3(f) defined by (7), must satisfy the normalization condi- 
tion (4) 


Lae 1 S 2 
exp {-3 } (| = | exp {= Baal (C) p(dl)< +0. (8) 


ry 


The normalization constant Y(6) will be called the Massieu-Planck function 
[Planck, 1932], [Miinster, 1959]. 
b. For arbitrary real constants /,, ..., Ay 


i 
[loner exp ie Bis «(0 (dl) < + 09, (9) 


“x 
Z 


c. The random variables a"(¢) are linearly independent, that is, no relation 
of the form 


Aya (fC) = Ag, (10) 


where Ao, A,,...,4n are constants (independent of ¢), holds with probability 
one in Z unless Aj= A, = ++ =A, =0. f 


If the functions @(£) are finite almost everywhere in Z, that is if 


atten cipe 1. (uae tenn) 

* This is equivalent to the statement: If B1€Ds and f2 € Dg, and if 0<p<1, then 
pPl+( —p)B2, which is the point with coordinates pf 1,+(1—p) 2u, is also couteined in 
Des that is, with any two points, Dg contains the line segment joining these points. 

Z Summation convention is applied for pairs of equal upper and lower indices; the 
summation goes from 1 to n. If the sum is restricted, this is explicitely indicated, e.g. 


3 See also appendix. 
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for every ¢ with the possible exception of a ¢-set with fu-measure zero, and this 
will usually be the case in the applications, the condition c. can be replaced by 
the generally stronger statement that no relation of the form (10) can hold al- 
most everywhere in Z. The latter statement is independent of 6. For the proot 
denote by F the ¢-set for which (10) is true. Condition c. states 


» 


il 
0= P;(F) = | exp [-sonte)f (Gia) 


F 


and the finiteness assumption implies |o,(¢)|< + co so that the non negative 
integrand does not vanish. But then we have necessarily 


which was to be proved. The meaning of c., is thus simply to exclude redundant 
variables. 

According to the postulate a constraint of the specified class determines a 
thermodynamic state. The converse is also true. The statistical characterization 
of the constraints in terms of generalized canonical distributions will of course 
be possible only under particular experimental conditions and with a limited 
degree of accuracy. As the functions a“(¢) are not necessarily integrals of 
motion the probability measure P;(d¢) will in general not be a stationary 
measure for a closed system, that is a system described dynamically by a time 
independent Hamiltonian, invariant under the time reversal operation. The meas- 
ure may be stationary for certain values of the parameters. 


5. The probability distribution of the extensive variables 


A possible value of the random variable a(¢) is a point in the n-dimensional 
real space R”. The phase measure w(df) and the functions a“(¢) induce a 
measure in Rk” by 


Vida .aa)=wicle(C\eda, w= 1,.:...0}, (11) 
so that for an arbitrary function f(a’, ...,@") on R” 
{f@, ...,a%) Vda? ... da") = { flat (0), ....0" (0) a (20). (12) 
Rr Z 


Occasionally we shall assume that V (d a ...da"), the structure function, is of 
the form 


V (da) ... da”) =exp fF po dat Hees (13) 


da... da” denoting the ordinary (Lebesgue) measure in R". ¥ 
Given the probability measure (3), (7) we want to determine the probability 

distributiom of the variables a@°(C),...,@°(¢) in &% This distribution can be 

determined from the cumulant generating function [Cramér, 1937] defined by 
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Os (+z) =log Hy exp {iz, a"} = log J exp {tz, a" (C)} Ps (dC). (14) 
Z 
Here 2z,,...,2Z, are real parameters. It follows at once from the definition of 


P;(d¢) that 
CO; (iz) =~ {¥(p)- Y (B—xiz)} (15) 


if the definition of Y(f) is extended to the complex plane. To obtain the prob- 
ability distribution, let D be a bounded subset of R” and define e(D,z) by 


C( D2) (Qn) *{ exp {—t2, 2° }da ...dz". (16) 
D 


Let F;(D) denote the probability of finding a value of a(¢) in D, then [Cra- 
mér, 1937] 
F;(D)= fe(D, 2) exp {Os (iz)¥dz, ..2 d2,. CU) 


Rr 


It is seen from (15) that the probability distribution F;(da'...da") is com- 
pletely determined by the Massieu-Planck function in the complex plane. On 
the other hand F,(da’ ... da”) can be expressed in terms of V (da’ ... da”) 


1 
P3ida@ ... da")=exp {-= [B.a* — ¥ (| VG@a da’). (18) 
Accepting (13), we obtain from (17) and (18) 


F;(da' ... da") =exp ~~ [bya - r(B)~ (a) du wnadan (19) 


1 : 

exp -5 Brat ¥(p)- gta =a) | exp {Cg (tz) —tz,a"}dz, ... dz, (20) 
Rn 

or 


1 
exp fer] 2a [ exp |—2 rp xi2)—Gy—xiznerll ae ate (21) 


Rn 


It follows from the conditions a. and b. of paragraph 4. that for arbitrary 
constants /,,...,An, Hp|A, a"|?< co, and by Schwarz’ inequality Hs A, a" ES, 
Thus the mean , 


Eg i, a" =A, Epa" =, oc", (22) 

and the variance 
Bg |A,a" — Eph, a’ P= xf” Ay dy; (23) 
exist for arbitrary real A,, ...,/,. The quantity xf” =x f is called the covari- 
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ance of a“(¢) and a’(¢), Covg(a", a’). Being the expected value of a non negative 
quantity (23) the quadratic form corresponding to xf” is non negative 


ff Ae 0: (24) 


The condition ¢. of paragraph 4. is equivalent to the statement that the quad- 
ratic form (24) is positive definite, that is 


xf AyA,=—0. implies, A, =--2=/15=0. (25) 


If not, there would exist a linear combination of the a“(¢) with vanishing 
variance (23), but that is equivalent to the fact that with probability one this 
linear combination is equal to its mean value, contrary to the hypothesis c. 

The existence of the mean values and covariances being established it follows 
from wellknown, easily verifiable properties of the cumulant generating function 
[Cramér, 1937] that 


ae pee CAB) 9 
atten wer ies ee 

n2 m2 
x ft" = Covp(a", a?) = 8 gy = —2 FB) (27) 


OU zy, Oy 


OPnO By 


6. Approximation to the probability distribution of the extensive variables 
obtained from the central limit theorem 


For the further discussion of the probability distribution of the random vari- 


ables a (C), ...,a"(f) it is convenient to introduce the variable 


py = alah (= a] (28) 


DG = 
Ve fh” at, oy 


where 2, are arbitrary constants, not all equal to zero. The variable x(¢) is 


standardized 
E;2=0, Egx?=1. (29) 


The cumulant generating function corresponding to x(¢) is 


oY 
Hy ere 6 
‘ rg) ] thx 
6 (tk) =log Ez exp tkx= —tk ohh tly ip)—¥( - i! (30) 
ey 
where al RCN De (31) 


and k is a real parameter. ~ 
If they exist, the cumulants of the distribution are defined as the derivatives 


of O(ik) taken at ik=0. The cumulants can be expressed as polynomials in 
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the moments of the distribution. We have for the first few, taking into account 
that x is standardized [Cramér, 1945] 


6g (0) =0 (32) 
6 (0)=1 (33) 
6® (0) = Hau” (34) 
6 (0) = Hax* — 3. (35) 


And combining this with (30) we find 


“° Y (6) 
es Ses SaN alas 36 
Hgx ~ Bp Xx aT Ly - OBn ap, ( ) 
ipo ee (37) 
at = == am Hol u OB, 2B, 0B. O By 


The cumulant generating function corresponding to the standardized Gauszian 
distribution 


d® (y) == exp {-30'}ay (38) 
V2 = 
. lan 
is log | d@(y) exp {iky}=5 (ik)’. (39) 


The second cumulant of the Gauszian distribution is equal to one, and all the 
others are zero. Thus the cumulants of an arbitrary standardized distribution 
describe in a systematic way its deviation from the Gauszian form. 

It is obviously not possible to procede further without making specific as- 
sumptions about the form of the Massieu-Planck function Y($) which deter- 
mines the probability distribution of «(¢). From the point of view of sta- 
tistical thermodynamics this amounts to a specification of the phase space Z 
and the functions a“(f). However, except for the case when the a(¢) happen 
to be sums of many stochastically independent and equally or almost equally 
distributed random variables we are not able to determine the form of Y ({). 
On the other hand, adopting an ‘“‘Kinsteinian point of view” [Hinstein, 1910], 
and considering the Massieu-Planck function Y(f) as an empirically determined 
thermodynamic quantity, we would expect Y(f), as an extensive variable, to 
be proportional to the size of the system. So, assuming that all the derivatives 
of the function with respect to the intensive parameters are of the same order 
of magnitude as the function itself, we would expect the r:th cumulant to be 

r/2 
of the order ~ est B)| : 
bs 
lants would then be small, and the Gauszian distribution would be a good ap- 
proximation for the random variable 2(f). Deviations from the Gauszian form 
could be determined from (36), (37). 


For macroscopic systems the higher order cumu- 
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A more precise formulation of the latter considerations is that Y (6) can be 
written in the form N Y,(f) corresponding to a possible decomposition of the 
system in a great number JN of, still macroscopic, subsystems which have iden- 
tical Massieu-Planck functions. The assumption that the subsystems are macro- 
scopic is necessary to assure that the Massieu-Planck function for the entire 
system is approximately equal to the sum of the corresponding functions for 
the subsystems. The assumption that the subsystem have identical Massieu-Planck 
functions can be replaced by various somewhat weaker homogenity conditions 
which we shall not state explicitly. They correspond to different formulations 
of the central limit theorem for sums of independent not equally distributed 
random variables [Gnedenko and Kolmogorov, 1954]. It is the first assumption, 
corresponding to stochastic independence of the subsystems, which is most dif- 
ficult to justify. What one would actually need is a central limit theorem for 
sums of dependent variables and such a theorem has not been proved at pres- 
ent.’ Taking the assumptions for granted we find that when the number of 
subsystems tends to infinity, the cumulant generating function for the distribu- 
tion of aw will for fixed k& converge to the form (39). It follows [Cramér, 
1945] that the distribution function for #(¢), ie. the integral of the probability 
density, will converge to the Gauszian form for every value of its argument. 
The purpose of this vague and heuristic discussion was only to indicate a way 
in which the problem of finding the asymptotic distribution of x for large sys- 
tems could possibly be traced back to the classical form of the central limit 
theorem for sums of independent variables. It is however possible that an actual 
derivation of the asymptotic distribution must procede along completely hetero- 
dox lines. Before discussing the Gauszian approximation to the probability dis- 
tribution of the extensive variables we shall derive some generalized thermodynamic 
relations. 


7. Thermodynamic relations 


According to the postulate in paragraph 4 a constraint of the specified class 
is characterized by a set of values of the intensive variables B= 6, Bb € Dz, and 
determines uniquely a thermodynamic state with coordinates «= H,a". Con- 
versely the thermodynamic state determines the constraint uniquely. The corre- 
sponding values f of the parameters B are obtained by inverting the relations (26) 

oY 
op" 

1 There are versions of the central limit theorem which consider sums of stochastically 
dependent random variables. But the dependence is of a very weak kind so that the vari- 
ables in the sum are asymptotically independent. A theorem of this type holds e.g. for the 
energy of one independent component of an assembly of non interacting systems when the 
probability distribution is microcanonical and the number of systems becomes large. A par- 
ticular case of this theorem was proved already by Rize els | Siaaeye 1878]. 

(A lee a aan ee) 


2 As is wellknown the non vanishing of the Jacobian age Pn ee is a sufficient condi- 
bE) ’ n 


(6) C= Ls aor W)e" (40) 


tion for the existense of an inverse transformation. However, the condition eee sufficient 
to assure the uniqueness of the inverse transformation (simple example yY=a ). The ee 
fact is forgotten in some texts on thermodynamics which at all discuss the ce of t 
inverse. The essential property used in the following proof is the convexity (24), (25), (27) 
of the thermodynamical potentials, that is the convex form of the thermodynamic surfaces. 
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This can be done uniquely for 


a (B1)= fe (82) (w=1,...,n)B1, B2EDz implies B1=/2. (4.1) 

OB OB, 
To see this let pl+p2 2 be two points in D3 and consider the joining segment, 
which is contained in D3; as Dz, is convex: B(p)=pfh1+(1—p)B2, O<p<l. 


The function f(p)= Y[f(p)] has the derivatives 


aY re cig eee Rey eee 
B( p), wo - amd. f° (p) = (Dap (Pip 
eBn ; 


with B(p)’=B1—f2. According to (24), (2é 
ways negative so that f’(0)>/f' (1) or 


AGO) 


Or 


); 


OY 91) 2* gol g1,—p9,)>0 


and it follows that 


for at least one value of uw. The statement (41) follows. 
Hence the Legendre transformation [e.g. Goursat, 1917] 


ay 
eB, 
BOY 
OB 


Cn = 


(B) 


Bu- 


is well defined and reversible 
(43) 


The system (42), (43) illustrates the thermodynamic signification of the Massieu- 
Planck function as S(«) can be identified with the entropy function of the 
system, being equal to the expected value of the probability exponent 


S (x) = Eg og = Eg (B, a" — Y]=B, a" — Y. (44) 


The following relation between the entropy and the Massieu-Planck function 
is a general property of the Legendre transformation 


ose y =F 
Ox ba" OB,OB, a3) 


ARKIV FOR FYSIK. Bd 17 nr 14 


The relation is an immediate consequence of the fact that 


ase. 
Cat 


Ba(ax) and eae 
OPy 
The values of the extensive variables A or of the intensive variables B are 
equivalent characterizations of a thermodynamic state and of the corresponding 
probability measure in the phase space. 
It follows exactly as above, as a consequence of the condition c¢. in para- 
graph 4., that it is possible to represent a thermodynamic state by the values 


(Bee ibe en ear) ORIN): (46) 


The corresponding Legendre transformation is 


BY a. . 
Pu au (u=1, :..,%) 
(47) 
OS 
Yea pol, 
w=1 ofa s 
and the inverse transformation 
pias A w=1,...,m) 
OBu 
(48) 
H=™ BV 
Se 2 ere a 


yor By 


The notation implies Y,= —S and Y,=Y. From (47) and (48) it follows that 
p=m pan 


a! = Zz ot d By — ye Buda", (49) 
1 


p= p=m+1 


and that for 0<l<m<n 


{— Ya, Gua oreg a} and om Bis Ls) wise’ Bm} (50) 


are connected by the Legendre transformations of m—J+1 variables 


aS) te ee ee ey i 
Ce ern ae ee Age ls (51) 
a & Pee 5 
Pa (4—ll,....4m), (—2)= br Gey (52) 
a OBu (u l m) ( 1) pear 0 Bu B, 


It follows from the general property (45) of the Legendre transformation 


weg a a) oe Yn = 
or a" 8,08, 


O7 . (A, val al ,.<..51)- (53) 
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8. Linear transformations of thermodynamic coordinates 


Thermodynamic constraints are expressed either as prescriptions for the values 
of the intensive B-variables or as linear conditions on the extensive A-variables, 
which obey conservation laws. A linear condition may have a particularly simple 
form in some coordinate system. We shall therefore consider non singular linear 
transformations of the extensive variables 


a’ = A (a — a) 
a = a + BY a 


A Bi = BY Ap = 6F. (54) 
The transformed entropy function S (%) is given by 
S (a) =S (w+ Ba), (55) 


and the corresponding intensive variables f, by 


@S(d) . : 
=a. 56 
6 at! p b ( ) 


Bu a 


The Massieu-Planck function Y (B) corresponding to (55) and (56) is 


Y (B) =S (a) — G" B,, = S («) — AX (a — ab) B, Bi = Y (B) + 08 By. (57) 


The corresponding transformation of the phasefunctions a, (¢) is 
a" (C) = At [a” (C) — a0]. (58) 
It follows directly from (57) and (58) that 
0 (6) = ¥ (B) + Bua (C)= ¥ (B) + Bua" (0), (59) 


which expresses the invariance of the probability measure under the combined 
transformation (58), (56), (57). The domain of permissible $-values is transformed 
by (56) into a domain Dg which is obviously also convex. 

It is clear that there is no ambiguity in expressing linear conditions on the 
A-variables in the form a” *'=---=q"=0. 


9, Relaxation of constraints 


A constraint of the prescribed class (paragraph 2) is characterized by the 
values of n parameters, either of the A-variables, or of the B-variables, or of 
a mixture of the A- and the B-variables. A different class of constraints is 
obtained if only a smaller number of parameters are specified, e.g. B,, ..., B, 
and A™*?, ..., A” with 0<l<m<n, and the variables A'*!, ..., A” are left 
undetermined. A priori we do not know whether a constraint of this type de- 
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termines a thermodynamic state or not. Having postulated a one to one cor- 
respondence between thermodynamic states and probability measures in the phase 
space we have to find out whether a probability exponent of the type 


lL 


yal Be 
Or Doped (C) 4 => par (C) (60) 


pw=1 p=m++1 


Q 

> 

ay 
I 


can determine a probability measure in Z. Obviously (60) defines a probability 
measure if and only if 


(Beet Us ic Os Bae ee pie Da (61) 
and then Cm VAG aries Pt Oscccas On Diaaaseenes Dn) (62) 


Let the constraint be given in the form 
BiB gn Diba SHO ee A Se (63) 
The condition (61) is then equivalent to 


7) 


VM a ects Die «ice es 6 a”")=$8,=0 (w=l+1, ...,m). (64) 


But (64) is the condition for Y, to be stationary considered as a function of 
at ..., x", the other variables being determined by the constraint (63). The 
stationary value is a minimum as follows directly from the fact that the second 
order differential is positive definite, this being a consequence of the assump- 
tion c. in paragraph 4. in the form (24), (25) and the relation (53) with m=n. 

Thus we can state: The constraint (63) determines a thermodynamic state if 
and only if Y, has a minimum when considered as a function of Ts nto 
under the given constraint. The minimum value is attained in this thermo- 
dynamic state. This is of course nothing but the thermodynamic stability con- 
dition, derived here from the postulate of paragraph 4. If in particular, only 
extensive variables are specified 1=0 and as Y)= —S, the condition for the 
existence of a thermodynamic state, is that S has a maximum as a function 


of the “free” variables. 


10. Marginal distributions of the extensive variables. 
The Gauszian approximation 


In this paragraph we shall conclude the discussion in paragraph 5. and para- 
graph 6. of the distribution of the extensive variables Gila), sap a (Ge As awe 
saw the n-dimensional probability distribution of these variables was obtained 
from the Massieu-Planck function by a Fourier transformation involving the 
cumulant generating function or equivalently the Massieu-Planck function (17), 
(15). If we are interested only in the distribution of some of the a’ (C):8 say 
a'*1(c), ...,a"(C) the other a” (¢):s being left unspecified, we put z,=-- ic as 
=z,=0 in (14) and (15) to obtain the cumulant generating function of 


ene 
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the corresponding marginal distribution. TH z,.1= ++: =2n,=0 we may replace y 
in (15) by Y,, and this function determines the (m —1)-dimensional probability 
distribution of a'*!(¢), ..., a@"(¢). In particular mean values and covariances are 
given by 
E,pat=2*8 (i=l 1 ean), (65) 
6B, 
oY 
Cov; (a’, a“) = — = (A, w=t+1, ..., Mm). (66) 
De Snes ee 
It follows from (53) that 
See fh Cov, (a", a’) = 0, (A, v=1+1 m) (67) 
water 0008 ot! : ie coe ek : 
GX, 


that is, the covariance is inverted by {2 Tae Ay hsb AM, mt. 
Hon Ou 
Let us return for’ a moment to the standardized random variable x(¢) in- 
troduced in paragraph 6, (28). Its distribution is determined from the cumulant 
generating function (30). If an homogenity assumption of the form discussed 
in paragraph 6 is accepted, the cumulant generating function can be expanded 
formally in a Taylor series in powers of the inverse square root of the “‘size”’ 
N of the system. The zero order term corresponds to the standardized Gauszian 
distribution function 


exp (—}2°). (68) 


To the Taylor expansion of the cumulant generating function there will then 
under fairly general conditions correspond an expansion of the frequency fune- 
tion f(x) for #(¢), the first two terms of which are 


1074 
if 6! 


V2 (4) 


f (2) = @ (=) — Fr 9 (a) + Hg (a) + 9 (a) + (69) 


and which, as Cramér has shown [Cramér, 1928], really gives an asymptotic 
expansion in powers of N~?, with a remainder term of the same order as the 
first term neglected. The parameters y, and y, which are given by 


71 = Eg x* = 6 (0) 
Yo = Hp x* -3=6 (0), 
are usually called coefficient of skewness and coefficient of excess, respectively. 


It is interesting to notice that the expansion (69) can also be considered as an 
expansion in orthogonal Hermite polynomials. 
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On the same assumptions a corresponding expansion may be obtained for the 
frequency function for the variables a'*!(¢), ... , a™(C). The zero order term is 
the Gauszian frequency function determined by the mean values (65) and the 
covariances (66). As a result of (67) it has the form 


m-l il eC ye 4 
) i l 4 ; 
(antes Det. = Et hor ls ny xX 
AN OMe Sea) 4 
A, =m oe Y (71) 
Ree l a 
AXP \— = DD a (a a iati= a"). 
2H Apal41 da OK 
In particular for 1=0 we have Y,= —/S and 
ies t 
—n/2 
Ca) ve u=1 mp > 
xOKOu 
A, u=m a S (72) 
x exp 4 += a area (a* — a*) (al — al"), 
Lee Aad OO ONC 
If the / intensive variables B,,..., B; and the n—m extensive variables 
ae : one : ¢ 
A”, ..., A” are fixed, the remaining free extensive variables A'*’, ..., A™ are 


° A = he . 6 
distributed around their mean values «'*?, ..., %” under the given constraint, 


which is assumed to determine a thermodynamic state, according to the prob- 
ability distribution 


C exp {-24 Y,(A a| dita 7 \ee od WA): (73) 


Here C is a normalization constant, i.e. independent of Co nes aC Ney (Nea) 
is formally equal to the first two terms of the Taylor expansion of 


1+ S| r 12 141 +1 
TU ADas cas 9 Piacl? Re eg exe Be NS LAA Pik Ge I eee) 


in powers of the deviations from the means of the free variables A'**,...., A™. 
As was demonstrated in paragraph 9, Y, is the appropriate thermodynamic 
potential for the given type of constraint, and has a minimum in the corre- 
sponding thermodynamic state. All derivatives of the first order vanish and thus 
(73) is seen to be only a particular case of (71). 

Corresponding to (72), J=0, Y,=—S, we obtain from (73) a version of Boltz- 
mann’s principle: ““Entropie = log (Wahrscheinlichkeit)”. Here “Entropie’’ is to 
be interpreted as the first non vanishing term in the Taylor expansion of AS. 

Objection could be raised against the way in which we have taken account 


of the constraint A”*!=¢"*!, ..., A"=«" for the determination of the distribu- 
tion (73). We have simply used the marginal distribution of the A™’, ..., A” 
with parameters f,, ..., 8, or equivalently p,, .... 61, att, ..., a”. In some situa- 


tions it would be more natural to consider not the marginal distribution but 
the conditional probability distribution of the A'*!, ..., A™. The condition should 
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be of the form A”*1=a"*1, ..., A"=a™. However, due to the fact that the 


distribution of A!, ..., A” is Gauszian we obtain in this way the same distribu- 
tionptoteA “=, utes ali}: 


11. Asymptotic expansion of y (a) 


We shall assume that we are dealing with a system of such a nature that 
an expansion of the Massieu-Planck function in powers of the inverse square 
root of a dimensionless size parameter NV is meaningful. We shall further assume 


1 
that the measure V (da' ... da") on R” has a density (13) exp {ovat 


Given the constraint B=, the probability density for the variables A‘, ..., A” 
is, without approximation, given by 


1 1 os 
exp ag Cae VE Gani, a‘j}= exp ‘: E (a) — S (a) oR (a> — 2} (Ui) 


Corresponding to the asymptotic expansion of the probability density we may 
write (75) in the form 


li gees 
i be Vga" s v Y s 
exp {2 Dbataa® *)@—« he (x) {1 +k (a, a}, (76) 
—nj2 —e S 2 
where Qa De. ((e/ zi) 
Oa" da’ 


and the function k(a, «) is a power series in N~?. Combining (75) and (76) we 
obtain the following expression for q (a) 


os Lee 
(C= aa 


Oo\=S 
7 (a) (2) Note 20a On” 


(a* — a) (a — a") + 


+ log C (a)+ log {1+k(a, a)}. (72) 


We recall that the terms in k(a, «) are n-dimensional Hermite polynomials with 
coefficients of definite orders in N~?. The first term of the expansion of (78) 
in powers of N ? corresponding to k (a, «)=0, is 


Seow eS Nee 
Po (4, Cast A 4 Marmara oe 5 jahoor te % (a =o = log Ca). (79) 


We can replace g(a) by (a, «) with an error of the order N~? which is of 
course not uniform in a and «. It might be remarked that the usual derivation 
of this result in the case n=1, A’= the energy, e.g. [Gibbs, 1902] is purely 
formal and can give no estimate of the error. 

The assumption that the exact form of @ (a) is 


P(A) = Pot Pua! + Py a" a” (80) 
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with constant coefficients @, Qu, Qu» is strictly incompatible with the basic 
assumption that 


n 1 
exp 77 (2) DO 240 = 4 cox (81) 


Rn 


Thus there is no system for which (80) is exactly valid. It has meaning only 


as an asymptotic result when, with «“ given by My» «= = Dis 
» . ip: 1 : na I I Le a) 
FP ag Fe CD Pomme ea Pe te Oe (82) 
APPENDIX 


The mathematical structure of the generalized canonical distribution 


In this appendix the generalized canonical distribution of paragraph 4. is dis- 
cussed in a less heuristic manner. A connection between the possible values of 
the phase functions and the possible mean values of the generalized canonical 
distribution, which generalizes results of Khinchin and Grad, is established. We 
shall take Boltzmann’s constant x=1. 

To every extensive variable A” (paragraph 2), there corresponds a real, meas- 
urable function a“ (f) of the phase. We denote by 


a(C)= {a (C), ..., a" (C)} (A 1) 
the corresponding measurable function from Z into the n-dimensional real 
space R”. 

We shall assume that the functions a (¢) are linearly independent. That is, 
if for some real constants Ay, A,, --., An 


dy +A," (¢)=0 (A 2) 


holds for all points of the phase space Z with the possible exception of a set 
with w-measure zero, it follows that 


Ip = Ay = 0 = Ay =0. (A 3) 


If this condition is not satisfied we may reduce the number of extensive vari- 
ables so as to obtain linearly independent phase functions. 
The functions a“ (¢) being measurable, the integral 


 (B)= J exp {-Bua" ()} (dC) (Ad) 
Z 
is defined for arbitrary real parameters (f,, ..., Pn) =/ and 
0<O(f)< +0. (A 5) 


As the integral of a convex function, ®(f) is itself a convex function of the 
parameters f,. That is, if 0<p<1 and f1, B2€R" 
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O [pB1+(1—p) B2)<p® (p1)+(1—p) ® (B62). (A 6) 
It follows that every n-dimensional point set of the form 
{B® (B) <c¢} (A7) 


with ¢ an arbitrary real number, if non empty, is convex. 

We shall assume that the integral @(f) is finite if B belongs to the, neces- 
sarily convex, open subset D; of R”, and infinite otherwise. In the one dimen- 
sional case of the ordinary canonical distribution one usually assumes that the 
integral J exp {—B8H(l)} (dC), H(¢)= the Hamiltonian, is finite for all values of 
B in the open interval 0<f< oc, [Khinchin, 1949]. 

The function a(¢) and the measure « (d¢) induce a measure on the (Borel) 
subsets of the n-dimensional real space R": Let B be a (Borel) subset of R” 
and B’ the corresponding subset of Z, €€B’cZ if and only if a(C)EBC AR’, 
and define the measure V(-) in R" by 


V (B)="(B’). (A 8) 


Evidently V (R")=(Z)= +00. The integral (A4) over Z can be written as an 
integral over Rk”: 


® (8) = J exp {—8,a"} V (da ... da”), (A 9) 


Rn 


and the integral is convergent if 6 € Ds. 

The measure V (J) of a finite interval is finite. This follows from the fact 
that the integral (AQ) is finite for some value of 6. Thus the integral over I 
is finite, and the integrand has a positive lower bound on J. Hence V (1) < co. 
The assumption that the integral (A9) is convergent for some value of f is 
thus a considerable restriction on the class of admitted phase functions a (€).1 

The support F of the measure V (da) is defined as the minimal closed subset 
of R” with the property V(R"—F)=0. That is, F consists of the possible values 
of the function a(¢). Hence any integral over R” with respect to V (da) can 
be replaced by an integral over F. 

The assumption that the » phase functions a’ (C), ..., a"(f) are linearly in- 
dependent implies that F contains at least (n+1) linearly independent points. 
Thus F is not contained in any linear subspace of R" with dimension less than 
mn. It follows that the convex hull F* of F has the same property and thus 
F* contains an open subset, or equivalently: the interior of the convex hull 
of the support of V (da), F*°, is not empty. 

For every value of the parameters 6 in the open region D,, the function 
(6) has derivatives of all orders. This can be shown in the same manner as 
it is demonstrated e.g. in [Widder, 1946] that the one-dimensional Laplace- 


‘ In the quantum mechanical case the assumption would e.g. imply that the spectrum 


of the operators corresponding to A is discrete, and that each eigenvalue has only a finite 
degeneracy. 
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Stieltjes transform has derivatives of all orders in the interior of its region of 
convergence. With — £,, in oo replaced by the complex variable 7z,, the integral 
defines for Im {z}€Ds, Re {z} arbitrary, an analytic function OT ve ror the n 
Ware bles.2) 7.452; 

The Massieu-Planck function Y (8) is defined by 


— Y (B)= log ® (f) (A 10) 
and hence from (A 5) 
—co <Y(B)<oo, (A 11) 


Like ® (8) the function Y (8) is convex. 
For complex argument, Im (z)€D,;, we define Y (—iz) by 


Jim | Y(—t2)— 7 (8) — YP (22 )i= los © (12): (A 12) 
Re {2}! 
Im {2}=,8 


The generalized canonical probability distribution in R" characterized by the 
parameter values / in Dz; is defined by the measure 


exp {Y (8)—f,a"} V (da! ... da”). (A 13) 
With Im {z}=8, Re {z}=é 


® (—iz) 


© (8) = exp {Y (8) — Y (—12)} = exp {Y (8) — Y (B—74)} (A 14) 


is the Fourier transform (characteristic function) of the probability measure 
(A 13). By the inversion theorem for the Fourier transform we have for an 
arbitrary bounded Borel subset B of Rk” 


| exp {Y (8)—B,a"} V (da ... da") = f{@ay Pa exp {-7&,a"} x 
. mS ep (A 15) 
de... da") exp iY (B)—Y (B-vE\i dE, ... dé, 


This is equivalent to : 


ae ee "J exp {-3 (E,#1B,\a*tda’ .. da”|x 
(A 16) 


x exp {— Y (@-716)} dé, ee AES 


or V(B)= i, [(2)-” [ exp {-i2z,a"—Y(—iz)}da* ... da"]dz, ... dz, 
B 


Re {z}e Rn 


which holds for every value of Im {z}=f in Dg. 
The first order derivative of the Massieu-Planck function 


vu. uHLHORN, Macroscopic observables 


= i exp {Y (f)—f, a} a" V (da 00"), (A 17) 


Rn 


is equal to the mean of the probability distribution (A 13). 
The second order derivative of Y (/) 


= } exp {Y (B)—pia’} (a“ — a) (a”— a") V (da‘s.. da*\a, (Ams) 


Rn 


is equal to the second central moment of the distribution (A 13). The corre- 
sponding real quadratic form is positive 


a (8) Ap Ane 0 (A 19) 
and definite (according to (A2) and (A3)): 
ot” (B)A,A,=0 implies A,=--- =A, =0, (A 20) 
otherwise we would have with non vanishing (A,, ..., An) 
Me (a*—a")=0 


for every a in F with the possible exception of a subset with measure zero, 
contrary to the assumption of linear independence. 

The “possible values” of the phase functions a“ (¢) form the closed set F, the 
support of V(-). The mean values «(6) of the generalized canonical distribu- 
tion (A 13) must for any B€ Dz, be contained in the convex hull, F*, of F. We 
shall prove that conversely every value « in the interior of F*, that is in F*°, 
is the mean value of a uniquely determined generalized canonical distribution. 
A corresponding theorem has been proved by Khinchin, [Khinchin, 1949], for 
the case of the ordinary canonical distribution (one function a” (¢), bounded from 
below, and corresponding to the energy). A generalization of Khinchins theorem 
to several phase functions has been given by Grad, [Grad 1952]. Grad restricts 
himself to functions a“ (f) of the phase, €=(q; ... des Py --» Pr) Which induce a 
measure V(-) in R” of the form v(a)da' ... da", with a continuous density 
v(a). His proof is incomplete. 

The theorem which we shall prove may be given the following precise for- 
mulation: 

If «* is a point in the interior of the convex hull of the support of the 
measure V(-), «€F*®, the n equations 


OF ir A ete 
Re Goes or 5 LY Sia Gal 0; (A 21) 


le 


Y (B) being defined by (A 10), (AQ), are satisfied for exactly one value of / con- 
tained in the convex, open set Dz. 
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For the proof we consider the centered phase functions a“ (¢)—o*" which in 
the following we shall denote by a“(¢). Thus it is no restriction to assume that 
the origin 0=(0,...,0) is an interior point of F* and to consider instead of 
(A 21) the equations 

GY 
OB y 


(B)=0, BEDz. (A 22) 


This, on the other hand, is the condition for a stationary value of Y (f) at the 
point Pp, which as a consequence of (A19) must be a maximum. The theorem 
is proved if it can be shown that Y(f) has a unique maximum or equivalently 
that ® (f) = exp { — Y (f)} has a unique minimum in Dy. Denoting by ®, the infimum 
of ©(8) in Dz we must show that there exists exactly one point 8 in Ds, for 
which ® (f)=Q,. 

The n-dimensional f-point set 


{B| ® (8) <®) +e} (A 23) 


where ¢>0 is arbitrary, is evidently contained in Dg and is non empty. As a 
consequence of (A6) the set is convex and due to the fact that ®(f) is con- 
tinuous in Dz it is closed. If the set (A 23) is also bounded it follows that for 
some point /* in the set @(f*)=Q). 

In order to show that the set (A 23) is bounded we shall first prove that if 
f is an arbitrary fixed point in Dz, y+0 an arbitrary fixed point in R” and 
t a positive variable, the continuous function @(f6+ty) of t increases infinitely 
with ¢. For sufficiently small ¢t, (6 +ty)€D,;, and D(6+ty)<oo. If ¢ increases 
there are two possibilities. Either (f+ty) reaches for some finite value of ¢ 
the boundary of Dg or ®(f+ ty) remains finite for all values of ¢. We shall 
see that also in the latter case the value of D(/+ty) increases infinitely with t. 
The integral defining ®(f+ty) gives rise to the inequality 


© (B+ty)= f exp {—-ty,a"} exp {—f,a"} V (da' ... da")> 
Rn 


2 J exp {-ty,a“} exp {—f, a"} V(da’ ... da")> (A 24) 


—y, > d>0 


i J Res Be 1 n 
> exp {td} i exp {-—8,a"} V (da ... da”). 
4, > 


The inequality (A 24) holds for any 6>0. To see that O(f+ty) increases in- 
finitely with t we have only to show that the last integral is non vanishing for 
some 6>0. The integrand being positive, this is equivalent to V {a|—y,a"> 0} > 0. 
According to our assumption 0=(0, ..., 0)€#*°. Now F*° is an open set. Hence 
there exists a 6>0 such that F*°N {a|—y,a">6}+¢. It follows that FN {a i 
—y,a">6}+¢. But this implies, as the set {a|—y,a“> 6} is open, according 
to the definition of F, that V {a|—y,a">6}>0, for otherwise we would have 
Fn {a|—y,a">6}=¢. Thus (A24) it scen to imply that M(B +ty) increases 
infinitely with f. 


253 


U. UHLHORN, Macroscopic observables 


This is sufficient to assure that the set (A 23) is bounded, for taking an 
arbitrary fixed point f in the set its distance to the boundary in an arbitrary 
direction is a continuous function of the direction (i.e. a continuous function on 
the unit sphere). This distance function assumes its maximum value for some 
direction. Thus the set is contained in a sphere with finite radius and center 
in fp. 

Hence it has been established that the function assumes its minimum value 
in some point of D;. Thus the equations (A 22) and consequently (A 21) can be 
satisfied. The uniqueness of the solution is a result of (A20) and has been 
proved in paragraph 7. 

We have found that for arbitrary «*€ F*° the function Y (8) —f,,«*" has a unique 
maximum for f= * where f* is determined by 


—— (B*)=a** (w=i,...,2). (A 26) 
Hence 
Y (B*)— Bi a**= Y (B)—By at" (6 arbitrary € Ds). (A 27) 


Introducing the entropy function S(«) (42), (43) we obtain 


OS (a) OS (a) 
aa S * =~ wes = a 
(a je Cal! x S (a) Cc! x ? 
or (A 28) 
: os 
S (a) — 8 (a*)= (2) (cc! — oc") 
Oa! 


The last inequality (A 28) holds for arbitrary « and «* in F*°, so that by the same 
argument 


aS (at 
S(a*)—S(a)= eee att — a (A 29) 
and hence 
08 (a*) Z os 
Far lata] & 8 (at) —S(a) 2 SO pan — a, (A30) 
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